Theoretical Error Prediction for a Language Identi cation System using Optimal Phoneme Clustering
نویسندگان
چکیده
using Optimal Phoneme Clustering Kay M. Berkling, Etienne Barnard (berkling,barnard)@cse.ogi.edu Center for Spoken Language Understanding, Oregon Graduate Institute of Science and Technology Abstract A neural network based language identi cation system is described, which uses language independent phoneme clusters as speech units to recognize the language spoken by native speakers over the telephone. We extend our previous work comparing phoneme-cluster and phoneme based approaches to language identi cation [1]. By creating a new speech unit valid across all languages in a theoreticallymotivatedmanner, we circumvent problems that are associated with ne phonemic modelling such as high complexity [4], extensive training requirements [2], and the linguistically arbitrary reduction to subsets of phonemes [4]. A common set of speech units across languages allows us to automatically derive discriminating sequences of any length and theoretically estimate the language identi cationerror. We demonstrateour implemented system for German vs. English on the OGI-TS database.
منابع مشابه
Theoretical error prediction for a language identification system using optimal phoneme clustering
using Optimal Phoneme Clustering Kay M. Berkling, Etienne Barnard (berkling,barnard)@cse.ogi.edu Center for Spoken Language Understanding, Oregon Graduate Institute of Science and Technology Abstract A neural network based language identi cation system is described, which uses language independent phoneme clusters as speech units to recognize the language spoken by native speakers over the tele...
متن کاملLanguage identification with language-independent acoustic models
In this paper we explore the use of languageindependent acoustic models for language identi cation (LID). The phone sequence output by a single language-independent phone recognizer is rescored with language-dependent phonotactic models approximated by phone bigrams. The language-independent phoneme inventory was obtained by Agglomerative Hierarchical Clustering, using a measure of similarity b...
متن کاملFast bootstrapping of LVCSR systems with multilingual phoneme sets
In this paper we described an e cient method to bootstrap continuously spoken, large vocabulary speech recognition systems by multilingual phoneme sets. To evaluate this techniques we collected the multilingual database GlobalPhone which currently consists of 9 di erent languages. A multilingual recognizer (MULTI) based on the four languages German, English, Japanese and Spanish was developed t...
متن کاملLanguage identification of six languages based on a common set of broad phonemes
ON A COMMON SET OF BROAD PHONEMES Kay M. Berkling ([email protected]), Etienne Barnard ([email protected]) Center for Spoken Language Understanding, Oregon Graduate Institute of Science and Technology, 20000 N.W. Walker Road, P.O. Box 91000, Portland, OR 97291-1000, USA ABSTRACT We describe a system designed to recognize the language of an utterance spoken by any native speaker over the te...
متن کاملSECURING INTERPRETABILITY OF FUZZY MODELS FOR MODELING NONLINEAR MIMO SYSTEMS USING A HYBRID OF EVOLUTIONARY ALGORITHMS
In this study, a Multi-Objective Genetic Algorithm (MOGA) is utilized to extract interpretable and compact fuzzy rule bases for modeling nonlinear Multi-input Multi-output (MIMO) systems. In the process of non- linear system identi cation, structure selection, parameter estimation, model performance and model validation are important objectives. Furthermore, se- curing low-level and high-level ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1995